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ABSTRACT 

This paper presents both the result of a search for fossil systems within the XMM Cluster Survey 
and the Sloan Digital Sky Survey and the results of a study of the stellar mass assembly and stellar 
populations of their fossil galaxies. In total, 17 groups and clusters are identified at z < 0.25 with 
large magnitude gaps between the first and fourth brightest galaxies. All the information necessary 
to classify these systems as fossils is provided. For both groups and clusters, the total and fractional 
luminosity of the brightest galaxy are positively correlated with the magnitude gap. The brightest 
galaxies in fossil systems (called fossil galaxies) have stellar populations and star-formation histories 
which are similar to normal brightest cluster galaxies. However, at fixed group/cluster mass, the 
stellar masses of the fossil galaxies are larger compared to normal brightest cluster galaxies, a fact 
that holds true over a wide range of group/cluster masses. Moreover, the fossil galaxies are found to 
contain a significant fraction of the the total optical luminosity of the group/cluster within 0.5i?200i & s 
much as 85%, compared to the non-fossils, which can have as little as 10%. Our results suggest that 
fossil systems formed early and in the highest density regions of the Universe and that fossil galaxies 
represent the end products of galaxy mergers in groups and clusters. 

Subject headings: galaxies: abundances — galaxies: evolution — galaxies: formation — galaxies: 
groups: general 



1. INTRODUCTION 

Hierarchical models of structure formation predict that 
galaxy groups and clusters (hereafter referred to collec- 
tively as galaxy systems) form via the g ravitational in- 
fallof field galaxies and smaller systems ()White fc Reesl 
[19781 . Th e effects of dark e nergy will eventually halt 
the in- fall (Nagaminc fc l.ocH [201 )■{:■. at which time the 
system can be considered fully assembled. Dynamical 
friction will cause some of these galaxies to merge, re- 
sulting in the formation of a massive ga laxy near the 
base of the gr avitational potential (e.g. iBarnesI 119891 : 
IDubinskil[l99cl . A magnitude gap can then develop as 
the timescale for orbital decay by dynamical friction is 
inversely proportional to galaxy mass and so the more 
mass i ve galaxies will tend to merge first (jD'Onghia et al.1 
120051: iDariush et alll2010[ ). The cooling timescale of the 
intergalactic medium is longer, so the system will be sur- 
rounded by a halo of X-ray emitting gas. Such systems 
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have come to be referred to as 'fossil groups'. 

The most commonly accepted definition of a fossil 
group (|Jones et al. 2003) requires i) an i?-band magni- 
tude gap of 2 or greater between the two brightest galax- 
ies located within half the virial radius of the system, 
and ii) extended X-ray emission with a bolometric X-ray 
luminosity Lx,boi ^ 5 x 10 41 h^Q erg s -1 . The X-ray 
criterion guarantees the existence of at least a group-size 
halo while the optical criterion ensures (approximately) 
that there are no L* galaxies inside the radius for orbital 
decay by dynamical friction. 

Originally, the term 'fossil group' (or 'fossil galaxy 
group') referred to an apparently isolated galaxy, which 
due to a surrounding halo of X-ray emitting gas, 
was assumed to ha ve previously resided in a group 
(|Ponman et al.l fl994f ) . The word 'fossil' was used be- 
cause the merger history of the group was thought to 
be contained within this single galaxy. Since then, fos- 
sil groups (i.e. galaxies) have been found within actual 
galaxy groups and this has lead to a degree of confu- 
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sion and a gradual shift from 'fossil group' referring to 
the individual galaxy to the group as a whole. Addi- 
tionally, since the definition of a fossil group that has 
developed only sets lower limits on both the size of the 
magnitude gap and the X-ray luminosity (Lx), it is pos- 
sible to find fossil groups with properties that are more 
consistent with those o f galaxy clusters, as we do in 
this work (but see also iMendes de Oliveira et all 120061 
2009; [Khosroshahi et al.f l2006b). Therefore, we replace 
the word 'group' with the word 'system', so that 'fossil 
system' (FS) refers to a galaxy group or cluster that sat- 
isfies the above two criteria and 'fossil galaxy' (FG) refers 
to the most-luminous galaxy in a FS. This is the nomen- 
clature that we shall use throughout the remainder of 
this paper. 

The initial interpretation of FGs was that they rep- 
resent the end product of galaxy merging in groups 
or clusters. The magnitude gap in an FS was ac- 
counted for by an early formation epoch that allowed 
time for sufficient merge rs. The observed regular, sym- 
metric X-ray emission (jKhosroshahi et all 120071) and 
highly concentrated m ass profiles ( Jones et al.l 120031 : 
iKhosroshahi et all 120041 ) of FSs support this idea. Like- 
wise, simulations have shown that FSs form a large 
fraction of their mass at hig h redshift, and that 



they form earlier than non- FSs (ID'Onghia et al.| T2 005: 



iDariush et all [20071: Ivon B enda-Bcckm ann et al.l 
Additionally. [D'Onghia et al.1 (|2005| ) found a correlation 
between the magnitude gap at z — and the formation 
time of the fossil system in the sen se that early-formin g 
systems have larger gaps (see also IDariush et all 12010). 
Note, however, that in-fall can result in a smaller mag- 
nitude gap without the system necessarily being late- 
forming. 

Simulations by Ivon Benda-Beckmann et all (2008) 
suggest that the formation of FSs is primarily driven by 
the relatively early in-fall of massive satellites with the 
magnitude gap arising after the system has built up half 
of its final mass. The formation appears particularly ef- 
ficient if the in-fal l occurs along filamen t s with small im- 
pact parameters (ID'Onehia et al.l|2005l : iSommer-Larsenl 
120061: Ivon Benda-Beckmann et al.ll2008h . 

Studies of the FS luminosity function 
(IMendes de Oliveira et al.l 120061: iCypriano et all 120061: 
iZibetti et al.l 120091: lAguerri et al.l l2011h have found 
parameters that are consistent with the universal 
lumin osity function of clusters derived by iPopesso et al.l 
(2005), although they are deficient in ~ L* galaxies. It is 
ossible that FSs formed wit h a deficit of ~ L* galaxies 
Mulchaev fc Zablu doff 1999)). This alternate formation 
scenario interprets FSs as 'failed groups' in which the 
majority of the available gas was initially used up in a 
single luminous galaxy rather than in several. Except for 
this deficiency in ~ L* galaxies, the properties of FSs are 
similar to those of galaxy clusters. Their X-ray scaling 
relations and masses are more consistent with cl usters 
than groups (IMendes de Oliveira et al.l 120061 120091: 
IKhosroshahi et al.l l2006bf )~ leading to the suggestion 
that F Ss are merely representative of a n evolutionary 
phase (jvon Be nda-Beck mann et al.l 120081 ) — after 4 Gyr 
~ 90% of FSs in simulations a re found to have become 
non-fossil (|Dariush et al.ll2007l ). 

In one of th e only studies of its kind, 
iDfaz-Gimenez et al.l ((2008) looked at the merger 



history of FGs in simulated FSs within the Millennium 
Simulation Galaxy Catalogue. They find that, like 
BCGs, FGs are mainly formed by gas-poor mergers but 
that FGs are formed later than BCGs, i.e. they undergo 
mergers at lower redshifts, despite the fact that FSs 
assembled most of their virial mass at higher redshifts in 
comparison with non-FSs. No age differences were found 
between t he stellar populations in FSs and bright field 
ellipticals ([La Barbera et ail 120091 ). which is consistent 
with a later formation if the mergers were gas-poor. 
Numerous studies have been made of the growth of 
brightest cluster galaxies (BCGs) and the results are 
conflicting. Simulations performed in an hierarchical 
context predict that BCGs are the result of multiple 
mergers an d therefore should contin ue to grow until 
z ~ 0.5 (e.g. lDe Lucia fc Blaizot]l2~007[ ). Observationally, 
howev er, BCGs are found to evolve passively since 
z ~ 1 (IBrough et al.ll200"l I27M IStott et al.l l27)08[ W^l 
iWhilev et al.ll2008HCollins et al.ll2009t) . 

Defining FS samples is difficult due to the need for 
both good quality X-ray and optical data, and so most 
of our understanding has come from simulations. The 
initial observational studies focused on small samples 
or single objects and so the number of known F Ss re- 
mains small (see IMendes de Oliveira et all [2006, for a 
good summary). Enabled by the large quantity of 
data produced b y surveys such a s the Sloan Digital 
Sky Survey (SDSS lYork et al.ll2006T ). more recent studies 
have focuse d on defining and a nalysing larger samples 
of FSs (e.g. iSantos et al.1 120071: lLa Barbera et"al1 120091: 



iVoevodkin et al.l 120101: iMillcr et al. 1 1201 ID . but in most 
cases these samples lack the high-quality X-ray data nec- 
essary to ensure that the definition of an FS is met and 
often the criteria have been relaxed to such an extent 
that the possibility of substantial contamination is high. 

The purpose of this study is two-fold. Firstly, we want 
to take the high q uality X-ray data from the XMM Clus- 
ter Survey (XCS: [Romer et al.|[200lt iLlovd-Davies et al.l 
[2^TllMeh7tens et al.ll20ilT) " and combine it with the opti- 
cal data from the SDSS Data Release 7 (DR7) to produce 
a secure sample of FSs; we choose purity over quantity. 
Secondly, we want to compare the stellar mass assem- 
bly and the stellar populations of FGs to various other 
samples, and to examine the X-ray scaling relations of 
FSs. 

The XCS is a serendipitous search for galaxy clus- 
ters using all publicly available data in the XMM Sci- 
ence Archive. Its main aims are to measure cosmolog- 
ical parameters and trace the evolution of X-ray scal- 
ing r elations. The first d ata release from XCS (XCS- 
DR1 iMehrtens et al.ll201lL hereafter Mil) contains 503 
clusters of which 402 have measured X-ray temperatures 
(Tx) and luminosities. The serendipitous nature of XCS 
is a big advantage because it allows the detection of 
smaller systems, which have a higher probabili ty of satis- 
fying the optical criterion in the FS definition ([Cui et al.l 
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The layout of the remainder of the paper is as follows. 
We outline the samples used in Section [21 including a 
review of the definition of an FS and our methodology 
for searching for them. In Section [3] we describe the 
various data-sets utilised in this study. The results of our 
study into the stellar masses and the stellar populations 
of fossil galaxies are presented in Section [4] while Section 
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[5] presents the results of our study of the X-ray scaling 
relations and stellar mass assembly of FGs and BCGs. A 
discussion of all these results can be found in Section [6] 
and our conclusions are presented in Section [7l 

We assume a ACDM cosmology with Hubble parame- 
ter Hq = 70 km s _1 Mpc -1 , dark matter density parame- 
ter fl\ — 0.73, and matter density parameter VLm = 0.27. 

2. SAMPLES 

In this section we outline our methodology for search- 
ing for FSs within the XCS and the SDSS and present our 
final sample of 17. We also discuss two samples of BCGs, 
one optically selected and one X-ray selected, which we 
use as comparison samples. We begin by making a few 
comments on how FSs are defined. 

2.1. The Definition of a Fossil System 

The m ost common l y use d definition of an FS is that 
found in Uones et al.l (|2003f ) . which is reproduced in the 
Introduction. This definition is motivated by physical 
arguments but is not beyond modification. The X-ray 
criterion was set to only select galaxies that reside in 
at least a group-sized halo and so does not have much 
scope for modification, the adopted magnitude gap, on 
the other hand, does. 

This is beca use studies of the magn itude gap in 
both clusters (iMilosavlievic et al.l 120061 ) and groups 
(jTavasoli et al.ll2011h have shown a wide range of gaps. 
The average magnitude gap between a 0.1 < z < 0.2 
BCG and t he second br i ghtes t cluster member is found 
to be 0.5 ()Pipino et al.l 120111 see also Loh & Strauss 
2006), with poor systems exhibiting a larger average 
gap. Statistically, it is easier to get large magnitude 
gaps when the number of galaxies in the system is low 
(IDariush et al.l l2007t iCui et al.l 1201 ll h In simulations, 
iDariush et al.l (|2007l ) found that the strongest X-ray FS 
candidates are those with the highest X-ray luminosity as 
these systems are not expected to have a large luminosity 
g ap entirely by chance . 

IDariush et al.l (|2010( ) have suggested, based on numer- 
ical simulations, that a magnitude gap of 2.5 between the 
brightest and fourth-brightest galaxies is a better indica- 
tor of an FS. They find that the 2 magnitude gap is better 
at finding high- mass systems but the 2.5 magnitude gap 
finds 50% more early-forming systems and those that are 
in the fossil phase longer. 

The definition of a fossil system th at we adopt here is a 
combination of the X-ray criterion oflJones et al.l and the 
optical criterion of IDariush et al.l ()201Cll h Therefore, for 
a system to be classified as a fossil by us it needs to have 
Lx ^ 5 x 10 41 /iy 2 erg s _1 and a magnitude gap of 2.5 in 
the r-band between the brightest and the fourth brightest 
galaxies located within half the virial radius, which we 
denote as Ami4. The virial radius is approximated with 
the radius at which the average density is equal to 
200 times the critical density of the Universe, and Lx 
is the bolometric X-ray luminosity inside this radius. In 
Appendix [A] we investigate how our results change if the 
standard defin ition of an FS is adopted, i.e. replacing th e 
IDariush et al.l magnitude gap with that of lJones et al.l 

2.2. The Fossil System Sample 

Previous searches for fossil systems began by looking 
for galaxies that satisfied the magnitude gap criterion 



and then tried to match them to an X-ray detection. 
Here we take the opposite approach and examine the 
galaxies associated with an extended X-ray source to de- 
termine the magnitude gap. 

We begin with the Zoo DR7 and Zoo candidate cat- 
alogs as described in Mil. In addition, we also allow for 
XMM Pi-targeted cluster observations to be included in 
our final sample (such systems are noted in Table [1}. 
Thus, our catalog of FSs contains serendipitously dis- 
covered FSs, as well as previously known groups/clusters 
which have been re-classified as FSs. Since we are study- 
ing the stellar populations of FGs, we also required that 
the brightest galaxy near the center of each candidate 
have a measured SDSS spectrum and we used the red- 
shift of that galaxy to define the redshift of the system. 
These constraints complicate the FS selection, and so for 
this work we avoid making any conclusions which require 
a known selection function (e.g., constraining the number 
density of FSs). 

We then identified FSs by examining the color- 
magnitude relation around each candidate. We used a 
system similar to the XCS-Zoo that is described in de- 
tail in Mil. The main difference being the addition of 
color-magnitude diagrams (CMDs) to help with the es- 
timation of the magnitude gap. Specifically, we visually 
examined X-ray images and optical color images from the 
SDSS overlaid with X-ray contours. These X-ray and op- 
tical images were created over 3' x 3', 6' x 6', and 12' x 12' 
fields of view. At the same time, we compared the im- 
ages to the r vs r — i CMDs using SDSS DR7 imaging 
data. 

A system was either designated an FS candidate or not 
based on the inspection of these images and CMDs. To 
be classified as an FS candidate the system must satisfy 
the following criteria: 

• Optical: A bright elliptical galaxy at or near the 
location of the X-ray source. 

• X-ray: Obviously an extended source, e.g. rather 
than a blend of point sources (see Mil). 

• CMD: Am i4 > 2.0 in the 3' x 3' image. 

It has recently been noted that the SDSS photometry 
system atically underestimates the luminosities of nearby 
BCGs (jBernardi et al.ll2007t lLauer et all 12007! ) with the 
discrepancy being a function of BCG radius. This prob- 
lem affects DR7, which we use, and is best described in 
the "Imaging Caveats" page of DR80. Therefore, we use 
a magnitude gap of only 2.0 when selecting the FS candi- 
dates, instead of the adopte d gap of 2.5, to allow for th e 
correction of this problem (jvon der Linden et al.ll2007[ ). 
All sources were examined by at least three co-authors 
and those that were classified as an FS candidate at least 
twice (~ 60 in total) were then examined more closely. 

Defining magnitude gaps in clusters requires accurate 
measures of -R200, Lx, and the magnitudes of the galax- 
ies, as well the ability to reject foreground/background 
galaxies. Therefore, the FS candidates that passed the 
above initial criteria were then put through the XCS 
analysis pipelines in order to measure Tx and Lx (see 

1 http:/ /www. sdss3.org/dr8/imaging/caveats.php 



4 



Harrison et al. 



Section 13.11 for more details). After doing so, we re- 
calculated the CMDs using the estimated i?200 values 
(specifically, out to 0.5i?2oo)- We discuss the robustness 
of our sample and results to changes to -R200 m Appendix 

E 

We also applied background corrections to the magni- 
tudes of th e large central galaxi e s usin g the algorithm 
defined in Ivon der Linden et al.l (|2007fl and tested in 
iVoevodkin et all (12008) . The average size of this cor- 
rection is 0.3 mag but could be as large as 1.0 mag. 
We note that ma ny existing F S sam ples that rely on 
SPSS data (e.g. iSantos et all 120071 : (La Barbera et al.l 
120091 iMiller et all 12011ft havenot made this important 
correction. This would result in incorrect FG luminosi- 
ties (which, as we shall see in Section [SJ is a significant 
fraction of the total system luminosity), smaller magni- 
tude gaps, and lower levels of completeness. 

Only then were we able to measure the magnitude gap 
from the brightest central galaxy out to some physical 
distance. In estimating the magnitude gap we only con- 
sidered galaxies that were within ±0.2 from the r — i color 
of the central galaxy. This is reasonably generous since 
the scatter in the red sequence is found to be less than 
half this amount. We also discarded galaxies from the 
CMDs that had SDSS spectroscopic redshifts > 2000 km 
s _1 away from the cent ral galaxy. For thos e galaxies with 
a photometric redshift (jCsabai et al.ll200"7l ) , we discarded 
galaxies beyond Az p hot =0.1. This cut is conservative 
since the average photometric redshift error for galaxies 
with m r ~ 19.7 is 0.04. No difference in the number of 
systems classified as fossils was found when this photo- 
metric redshift cut was doubled to Az p /j t = 0.2. This 
resulted in a clean (but possibly incomplete) sample of 17 
FS. This iterative procedure is important and it allowed 
us to rigorously check which of the 60 FS candidates were 
in fact true FSs. 

The positions of these systems in the parameter space 
used to define FSs is shown in Figure Q] The red cir- 
cles are our sample while the blue squares are the XCS- 
DR1 groups/clusters of Mil (see Section T2.3I for more 
details) . The horizontal and vertical dashed lines denote 
the optical and X-ray criterion respectively; points that 
lie above and to the right of these lines are classified 
as FSs. There are two systems that have Am^ < 2.5, 
which we accept as FSs. The system with the arrow 
is XMMXCS J124425.9+164758.0 (hereafter referred to 
simply as J124425.9), which has an FG with a double nu- 
cleus. This makes the sky-subtraction correction highly 
uncertain and so we elect not to apply it. If we were 
to apply the estimated correction of 1.0 mag then Ami4 
changes from 2.3 to 3.3, which is denoted by the thin ar- 
row in Figure [T] Even if we only apply the average sky- 
subtraction correction of the other 16 FSs (0.3 mag) then 
Ami4 = 2.6 and so we accept J124425.9 as an FS. The 
other system is XMMXCS J172010.0+263724.7, which 
actually has a magnitude gap Am u = 2.45 and is close 
enough to 2.5 that we accept it as an FS too. Our re- 
sults do not change if these two FSs are excluded. We 
note that four FSs were targeted for observation by XMM 
and are thus not a part of the XCS-DR1, which includes 
only serendipitously discovered clusters. These targeted 
systems are labelled in Table [T] and our conclusions are 
unaffected if we exclude them from our analyses. 

For three FSs, there are galaxies near 0.5i?2O0, such 
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Figure 1. The distribution of our sample (red circles) and 
the XCS groups/clusters (blues squares) in the parameter space 
used to classify FSs. Points that lie above and to the right of 
the dashed lines are classified as FSs. Two systems that have 
Ami2 < 2.5 are classified as FSs. The system with the arrow 
is XMMXCS J124425.9+164758.0, which has a highly uncertain 
sky-subtraction correction that we elect not to apply. If we ap- 
ply the correction (of 1.0 mag) then this results in a magnitude 
gap of 3.3, as denoted by the thin arrow. The other system 
is XMMXCS J172010.0+263724.7, which has a magnitude gap 
Amy = 2.45 and is close enough to 2.5 that we accept it as an FS. 
The three thick arrows indicate how the magnitude gap changes 
when the error on -R200 is taken into account. For all other sys- 
tems the magnitude gap is unaffected by the errors on i?200- 

that a slightly smaller -R200 excludes one or more of them 
as members or a slightly larger -R200 includes one or more 
of them as members. If the galaxy is amongst the four 
brightest in the system then this can have the effect of 
increasing or decreasing the magnitude gap, respectively. 
We show the magnitude of this systematic effect in Figure 
[T] (thick arrows) after changing -R200 by one standard 
deviation of its statistical error (see Section 13. 4|) . The 
magnitude gaps for the other 14 FSs are unaffected by 
the statistical uncertainties on i?2oo- 

Details of the FSs are given in TableQ]and details of the 
FGs are given in Table [5] Descriptions of the individual 
systems, along with CMDs and SDSS images with XCS 
contours overlaid are given in Appendix [B] 

2.3. The BCG Samples 

One of the main aims of this study is to characterise the 
stellar populations and the stellar mass assembly of FGs 
and to compare them with those of BCGs in non-fossil 
systems. For this purpose we use two samples of BCGs, 
one optically selected and one X-ray selected. For a fuller 
description of the various catalogs on which the samples 
were based the reader is referred to the references below. 

The first sample of BCGs consists of optically-selected 
galaxies drawn from two ca talogs: the C4 BCG catalog 
(Ivon der Linden et al.l [2007) and the maxBCG catalog 
(|Koester et al.ll2007a|) . We use both these catalogs be- 
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Table 2 

The details of the fossil galaxies in our sample. The ID matches a galaxy to a system in Table [T] L ga i is the luminosity of the FG in 
10 11 Lq; M* is the stellar mass of the FG in 10 12 Mq; the age of the FG is in Gyr; Z is the metallicity of the FG as a percentage; and 
SSFR is the specific star-formation rate in yr _1 . Luminosities and z are taken from the SDSS; M,, age and Z are from STARLIGHT; and 

SSFR are taken from the MPA-JHU database. 
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cause, combined, they fully cover the redshift range of 
our FGs. The second sample of BCGs is drawn from the 
XCS (Mil). We will refer to the former sample as the 
optical BCGs, the latter as the XCS BCGs, and, when 
combined, simply as BCGs. There are considerably less 
XCS BCGs than optical BCGs, but they are drawn from 
an X-ray selected cluster sample and therefore are useful 
in assessing any bias that may result from comparing an 
optically-selected sample (the optical BCGs) to an X-ray 
selecte d sample (the FGs). 

The Ivon der Linden et all C4 catalog is based on the 
IMiller et all (120051) C4 catalog but utilises a better al- 
gorithm for identifying the BCG. It is derived using the 
SDSS DR4 spectroscopic sample and it identifies clusters 
in position, redshift, and color parameter space assuming 
that a fraction of the cluster galaxies form a red sequence. 
The C4 catalog contains 625 BCGs at z < 0.1. 

The maxBCG algorithm for cluster detection relies 
on two characteristics of galaxy clusters: the brightest 
galaxies in a cluster occupy a narrow region of color- 
magnitude space (the red sequence) and the brightest 
galaxy in the cluster is located near the centre of the 
galaxy distribution. Galaxies are designated as BCGs 
based on the product of two likelihoods. The first is the 
likelihood that the galaxy is spatially located in an over- 
density of galaxies with similar g — r and r — i colors. The 
second is the likelihood that it has the color and magni- 
tude properties typical of BCGs. The algorithm can be 
run on any catalog of galaxies and doing so on the SDSS 
database results in a catalog consisting of 13823 BCGs 
with redshifts 0.1 < z < 0.3, which is over 85% complete 
for halos with masses above 1 x 10 14 M©. To match 
our FS sample, we restrict the maxBCG catalogue to 
z < 0.25. 

XCS-DR1 (Mil) consists of 503 optically confirmed X- 
ray clusters at z < 1.46, 402 of which have published X- 
ray temperatures and luminosities. From these we draw 
a sample of clusters that are within the SDSS footprint, 
have an observed spectrum, and have z < 0.25 (the red- 



shift of our highest-redshift FS), for use as a comparison 
sample. For each cluster a BCG was identified from an 
SDSS image and in the cases where the BCG was not 
obvious the brightest galaxy closest to the X-ray posi- 
tion was selected. The positions of these clusters in the 
parameter space used to define FSs is shown in Figure [TJ 
Note, in defining the magnitude gap for the XCS BCGs 
we used the same method that was used in defining the 
FS magnitude gaps. 

There are 39 XCS clusters that satisfy the above three 
criteria and these clusters have a flat distribution in red- 
shift space very similar to that of the FSs. The optical 
BCGs have a distribution that peaks at z ~ 0.25, there- 
fore, to minimize any evolutionary effects, we randomly 
resampled the original sample of 14448 BCGs (C4 and 
maxBCG combined) to create a sample that has a flat 
distribution over the same redshift range as our FSs, re- 
sulting in a final sample of 2687 optical BCGs. 

3. DATA 

In this section we describe the various datasets (X- 
ray, optical, and stellar population parameters) that were 
used in this study. We conclude the section with a dis- 
cussion of the errors on the various quantities. 

3.1. X-ray Data 

The X-ray (Tx and Lx) and -R200 data used in 
this paper come from the XCS-DR1 (Mil). The XCS 
is a serendipitous search for galaxy clusters using all 
publicly available data in the XMM Science Archive 
(jRomer et al.1 12001). The procedures used for measur- 
ing Tx and Lx, and for estima t ing ife oo are outlined in 
Section 4 of iLlovd-Davies et al.l (|201lL hereafter LDllfl 
so we give only a brief description here. Tx was measured 
by performing spectral fitting to background-subtracted 
spectra with xspec. The best-fit was det ermined us - 
ing the maximum likelihood Cash statistic (|Cashlll979l ). 

2 Actually, the process for estimating Lx,500 an d R500 is de- 
scribed, but it is similar to that for estimating Lx,200 an< i ^?200- 
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Lx was measured by extrapolating surface brightness fits 
to a simple o ne-dimensional, spherically symm etric, /3- 
profile model (jCavaliere fc Fusco-Femianolll976l) . i? 2 oo is 
then estimated from the lArnaud et all (|2005t ) ?Y-i?9nn 
relation. 

3.2. Optical Data 

All galaxy magnitudes are SDSS DR7 Petrosian mag- 
nitudes that have been extinction corrected , k-corrected 
(kcorrect v4.1.4; IBlanton fc Roweii 120071), and sky- 



subtraction corrected (jvon d cr Lind en et al.l 12007). To- 
tal optical luminosities of groups/clusters (L to t) are esti- 
mated from all galaxies that are: 1) less than 0.2 redder 
and 0.2 bluer than the r — i color of the BCG; and 2) 
dimmer than the BCG and within a radius of 0.5i?2oo 
to an absolute r-band magnitude of —19.67 (or 20.8 at 
z = 0.25). Galaxies with spectra are restricted to those 
within 2000 km s _1 of the cluster, while those without 
are restricted to Az p hot — 0.1, where we use the z p hot 
from the SDSS DR7. These selection criteria are identi- 
cal to those used in defining the magnitude gaps of the 
systems (see Section 12. 2[) . The radius of 0.5i?2oo was 
chosen as it matches the radius used to define an FS and 
changing it to i?2oo has no effect on the results. In addi- 
tion to the optical data, spectroscopic redshifts (z) of the 
FGs/BCGs are also obtained from the SDSS and used to 
define the redshift of the system. 

3.3. Stellar Population Data 

The SDSS has provided galaxy spectra in unprece- 
dented numbers, which has allowed for statistical stud- 
ies of the star-formation histories and stellar populations 
of various galaxy samples to be performed. The by- 
products of these efforts have been numerous new meth- 
ods for estimating the associated parameters, such as 
ages, mctallicities and elemental abundances etc. Com- 
monly, these data are stored in public databases for gen- 
eral use or the method is made public to allow it to be 
applied to different samples. 

Each of these methods approach what is essentially 
the same task in a variety of ways. Thus, each 
method has it own set of strengths and weaknesses. In 
this study w e utilise two different metho ds/databases: 
STARL IGHT (ICid Fernandes et al.l I2005D and MPA- 
JHU (IKauffmann et al 1 120031: IBrinchmann et al.l 120041: 
iTremonti et al.l l2004t iSalim et al.ll2007t) . Brief outlines 
of both are given below but for more details readers are 
referred to the references provided. We obtain stellar 
masses (M„), ages, and metallicities (Z) from starlight 
and specific star-formation rates (SSFR) from MPA- 
JHU, which also provides stellar masses that we use to 
test the robustness of some of our results. We chose to 
use the Starlight data over the MPA-JHU data for two 
reasons: the diffu sion map method of se lecting a popula- 
tion basis used in lRichards et al.l ()2009D has been shown 
to reduce the age/metallicity degeneracy; and we note 
that the MPA-JHU did not release ages nor metallicities 
beyond DR4, which is a much smaller sample than DR7. 

3.3.1. Starlight 

The population synt h esis code starlight 
(|Cid Fernandes et al.l 12004 I2005D fits an observed 
spectrum with a linear combination of single theoretical 



stellar populations (coeval and chemica l ly hom ogeneous) 
computed with the IBruzual fc Chariot! (120031) evolution- 
ary synthesis models. Ric hards et al.l (|2009D show that 
the accuracy of this technique is highly dependent on the 
choice of input basis of simpl e stellar popula tion (SSP) 
spectra. Thus, we use the IRichards et al.l diffusion 
map methods to choose appropriate bases of prototype 
SSP spectra. These methods allow us to approximate 
the continuous grid of age and metallicity of SSPs of 
w hich galaxies ar e truly composed. As shown in detail 
in Richar ds et al.l . we obtain robust stellar ages, stellar 
metallicities, and stellar masses and at the same time 
significantly reduce the degeneracy between age and 
metallicity. We correct the stellar masses for the fact 
that they are observed within a 3" diameter fiber using 
the same correction as applied to the MPA-JHU stellar 
masses (see Section 13.3.21) . In Table O we provide the 
results of these fits for stellar mass, luminosity-weighted 
ages and stellar mass- weighted metallicities. 

3.3.2. MPA-JHU 

The MPA-JHU team has publicly released catalogs of 
derived physical properties for a sample of SDSS galax- 
ies, bu t the only property tha t we make use of is the 
SSFR (|Brinchmann et aT1l2004D . although we make use 
of their stellar masses to estimate an average error for the 
starlight stellar masses. The catalog is being recre- 
ated for the SDSS DR7, for which there are properties 

for 818333 galaxies. 

The method of IBrinchmann et al.l (|2004[) to estimate 
star-formation rates is built on the methodology of 
iCharlot et all (120021) and the mo delling of emission 



lines by Chariot fc Longhettil ()2001[ ) , which combine the 
IBruzual fc Charlotl ( 19931) galaxy evoluti on models wit h 
the emission line modelling from cloudy (Ferland 1996). 
Dust attenuation is the major sou rce of uncertainty in es- 
timating star-formation rates and Brinchman n et al.l fo l- 
low the dust treatment outlined in lCharlot fc F all ( 2000) , 
making an initial estimate of dust attenuation based on 
Ha/H/3. A grid of model Ha line-strengths (with a par- 
ticular dust attenuation applied) are compared with the 
observed spectrum and the be st match selected using a 
Bayesian approach similar to IKauffman n et all (|2003|) . 
Ha luminosity is th e n con verted to a SSFR following 
ICharlot fc Longhettil ([2001). The estimated SSFR of 
each FG is given in Table [3] 

Stellar masses are estimated by multiplying the dust- 
corrected luminosity of the galaxy by the stellar mass- 
to-light ratio predicted by their best-fi t model using the 
Baye sian approach mentioned above (jKauffman n et aLl 
2003). They extrapolate the mass-to- light ratio and the 
dust attenuation values estimated within the SDSS fibre 
to obtain total stellar masses. 

3.4. Errors 

Derivation of the X-ray data errors is described in 
LD11 and we will only give a summary here. For Lx 
and Tx the parameter is stepped (both in the nega- 
tive and positive directions) from its best-fit value un- 
til the fit statistic increases by the amount required for 
the confidence region needed (i.e. 68%). The i?200 errors 
wer e calculated by prop agating the Tx errors through 
the lArnaud et al.l (|2005l ) Tx-i?200 relation. 
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Errors on L ga i, L to t, £24, an d the magnitude gaps are 
from the SDSS photometry and when converting to so- 
lar luminosities both the error on the absolute g-band 
magnitude (0-02 mag) and the solar g-r color (0.02 mag; 
Bili r et all [2001 are propagated through. In addition 
to these statistical errors we also derive errors on A77114 
due to the errors on i?200- This only affects the magni- 
tude gap in three out of the 17 FSs and in none of the 
three does it negate the system's classification as an FS. 
These systems have this error provide in parentheses af- 
ter the statistical error in Table [TJ In some cases, the 
errors on -R200 a lso induce additional errors on the total 
optical luminosities. In half of our sample, the error on 
i?200 does not add or subtract any galaxies into the to- 
tal optical luminosity. In the other half, the luminosities 
do change and these have the largest errors in Table 2. 
These systematic luminosity errors are still small when 
compared to the total optical luminosity of the system 
which is dominated by a single very bright galaxy. How- 
ever, the fractional error on the £24 can be quite large 
and one-sided. 

There are no errors on age and metallicity because 
Starlight does not provide any. MPA-JHU does pro- 
vide errors on their SSFR but we do not use them since, 
as for age and metallicity, we are comparing the SSFR 
distributions and not individual systems. We estimate 
average stellar mass error from the scatter in the dif- 
ferences between the Starlight and MPA-JHU stellar 
mass estimates, which encodes the scatter introduced by 
the use of two distinct statistical techniques and two dif- 
ferent SSP models. We use these average errors (0.047 
dex for the FSs and 0.079 dex for the XCS BCGs) in 
relevant plots but do not provide them in Table [2] 

4. STELLAR POPULATIONS 

If FGs formed at a different epoch to BCGs or followed 
a different evolutionary path then evidence of this may 
be found within their stellar populations. In this section 
we use the stellar masses, ages, and metallicities from 
STARLIGHT, and the SSFRs from the MPA-JHU database 
to compare the stellar populations of the FGs with those 
of the BCGs and investigate this issue. 

We start by simply comparing the stellar masses of 
the three samples, the distributions of which are shown 
in Figure [2j The dashed lines mark the median mass 
for each of the samples. From this plot it can be clearly 
seen that FGs are amongst the most massive galaxies in 
the Universe as, on average, they are more massive than 
both samples of BCGs. In fact, ~ 80% of the FGs are 
more massive than the average BCG. The median stellar 
mass of the FGs is (M») = (7.6 ± 1.2) x 10 11 M©, while 
optical BCGs have (M*) = (4.1 ± 1.9) x 10 11 M© and 
XCS BCGs have (M») = (3.7 ± 0.1) x 10 11 M©, i.e. FGs 
are roughly twice as massive as BCGs. We note that the 
same results are found when the MPA-JHU photometric- 
based stellar masses are substituted for the starlight 
spectroscopically-based stellar masses. 

Is this difference between the FG and the BCG stel- 
lar masses significant? A simple statistic to use would 
be the Students t-test, however, this test compares the 
samples individually and assumes that the stellar masses 
are drawn from a normal distribution. The Welch varia- 
tion of the t-test is applicable if the sample variances are 
unequal or the sample sizes are different. If the distribu- 
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Figure 2. The STARLIGHT stellar mass distributions for the vari- 
ous samples showing that FGs are amongst the most massive galax- 
ies in the Universe. The dashed lines mark the median mass for 
each of the samples. 

tions are not normal but they are similar (homoscedas- 
tic), then the Mann- Whitney U-test is appropriate for 
individual comparisons. Unfortunately, our data are not 
homoscedastic and we need to make multiple tests simul- 
taneously, therefore none of these standard techniques 
are desirable. 

We are interested in a joint comparison of the masses 
from distributions which are not Gaussian and from sam- 
ples of widely varying sizes (see Figure H} . Thus we use 
mu ltiple hypothesi s testing (or simultaneous inference — 
sec 1 IShaffer I (p95h for an excellent review on this sub- 
ject), since if not accounted for, the multiplicity can 
result in an over-estimation of statistical significance. 
Since we are comparing samples that have different sizes 
and distributions (i.e., the random variables are not het- 
eroscedastic) , the t-test-family is not appropriate. In- 
stead, we take the most robust statistical approach and 
control the family-wise error rates for all pairwise com- 
parisons of group differences via the max t-test using 
a heteroscedastic consistent covariance e stimation (see 
iHerberich etal~ll20loj: iRichardson |[20TTh . On a techni- 
cal note, we utilize single-step procedures to adjust the 
p- values for their multiplicity, meaning that the order of 
the tests is not important. 

We find that the stellar masses of the FGs are sig- 
nificantly higher than those of the optical BCGs and the 
XCS BCGs. The probabilities that the FGs and the opti- 
cal (XCS) BCGs have the same stellar masses is < 0.005 
(0.003) at the 95% confidence level. We also find that 
there is no difference between the stellar masses of the 
XCS and optical BCGs. We note again that these statis- 
tics are robust, in that no assumptions regarding the 
distribution, sample sizes or variance homogeneity have 
been made. Had we not accounted for the heteroscedas- 
ticity of the samples, our reported probabilities would 
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Figure 3. The scaling relations of SSFR (top), age (middle) and 
metallicity (bottom). The red circles are the FGs, the blue squares 
are the XCS BCGs, and the density contours are for the optical 
BCGs. The density contour levels are 5, 10, 20, 50, 100, and 200 
galaxies per bin. The stellar populations of the FGs are found to 
be consistent with those of the BCGs. 



have been more than 10 times smaller (and our signifi- 
cance much higher) . Had we used the Mann- Whitney li- 
test (which assumes that the distributions of the masses 
are the same in the different sets), we would have also 
reported 10 times smaller probabilities (and higher sig- 
nificances). 

Performing the same statistical test on the stellar pop- 
ulation parameters as the stellar masses we find that 
there are no differences between any of the three sam- 
ples in their distributions of SSFRs, ages and metallici- 
ties, results which ar e confirmed by K-S testin g. Similar 
results were found bv lLa Barbera et al.l (|2009f ). although 
when comparing FGs to a sample of bright field ellipti- 
cals. No differences were reported between the ages and 
metallicities of both these samples. 

We show the distribution of the three stellar popula- 
tion parameters as a function of stellar mass for all three 
samples in Figure where red circles are the FGs, the 
blue squares are the XCS BCGs, and the density con- 
tours are for the optical BCGs. This is for the sake of 
clarity since there are ~ 2700 optical BCGs. The density 
contour levels are 5, 10, 20, 50, 100, and 200 galaxies per 
bin. The fact that FGs are amongst the most massive 
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Figure 4. The Lx-Tx relation for the FSs ( red ci rcles). Also plot- 
ted are the FS dat a fro m Khosroshahi ct al. ( 200 ^, gre e n tria ngles) 
and Proctor ct al. (2011, blue squares ). the l W u et al.l 1119991) clus- 
ter data (black crosses), and the Osmond & Ponman (2004) group 
data (black stars). FSs are found at all Lx and Tjf except those 
of the most massive clusters. The absence of FSs at low Lx and 
Tx is due to the Lx cut-off used in the definition of an FS. 

galaxies in the Universe is highlighted by these distri- 
butions where they mostly hug the massive edge of the 
BCG distributions. 

Using a 2D, two-sample K-S test to compare the opti- 
cal BCGs to the XCS BCGs (i.e. X-ray selected) we find 
no difference between any of the stellar population pa- 
rameters, which allows us to compare the X-ray selected 
FGs to the optically selected BCGs and take advantage 
of their larger sample size. Comparing the stellar popu- 
lations in FGs with this sample we find that the probabil- 
ity that their SSFRs and ages were drawn from the same 
distribution is < 0.005 and < 0.01 for their metallicities. 
Given that ID test above found no differences between 
the stellar population parameters of all three samples, we 
suggest that this confirms the earlier result that, on aver- 
age, FGs are significantly more massive than BCGs and 
that the differences found in the 2D testing are driven by 
these differences in mass. 

In summary, we have performed a careful statistical 
analysis of the differences between the FG stellar masses 
and stellar populations, and those of the BCGs. Our 
analysis takes into account the differences in the sam- 
ple distributions and sizes, as well as the multiplicity of 
the tests. We find that the FGs have significantly higher 
stellar masses than the BCGs but similar stellar popula- 
tions. In the following sections, we explore the cause of 
this stellar mass growth in the FG with respect to the 
magnitude gap in the FS. 

5. STELLAR MASS ASSEMBLY 

We begin this section by looking at the X-ray scal- 
ing relations of the FSs and comparing them to those 
of the XCS BCGs. In Figure H we show the L x -T x 
relation of the FSs (red circles) . Also plotted are the 
FSs fromlKhpsroshahi et al.l (|2007l green triangles) and 
iProctor et al.l (|2011l , blue squares), the galaxy group data 
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Figure 5. The optical luminosity of the galaxy system within 
0.5-R200 as a function of Lx- There is no difference between the 
FS relation and that of the XCS clusters. Red circles are FSs and 
blue squares are XCS clusters. 

from lOsmond &: Ponmanl (120041 black s tars), and the 
galaxy cluster data from lWu et al.1 (|1999l black crosses). 
From this figure we see that FSs fall on the same relation 
that both galaxy groups and clusters do. In fact they are 
found at all Lx, ranging over three orders of magnitude, 
except those of the most massive clusters. There is an 
absence of FSs at low Lx, which is due to a selection 
effect, i.e. the Lx cut-off used in the definition of an FS. 
The lowest-Lx FS, which has too large a Tx for its Lx, 
is XMMXCS J030659.8+000824.9 (hereafter J030659.8) 
and we note that its classification as an FS is uncertain 
for reasons given in Appendix [B] 

In Figure [5l we examine the optical luminosity of the 
galaxy systems within 0.5i?2oo (Ltot) as a function of Lx- 
Red circles are FSs and blue squares are XCS clusters. 
In a comp rehensive study of t he X- ray scaling relations 
of FSs by iKhosroshahi et~aT1 (|2007l) . the authors found 
evidence that FSs were boosted in L x for a given optical 
luminosity (see also lSantos et al.ll2007t ). which is also pre- 
dicted by in the N-body simulations of iD'Onghia et all 
(2005) . However, ot her studies have found no difference 
(jAguerri et al.ll20lTI) and have concluded that there is a 
possible systematic difference between the studies or that 
the difference is real but only for less massive systems. 
We find no evidence for boosted Lx in our data, rather 
we find that FSs, over more than three orders of magni- 
tude in Lx , populate this diagram in the same way that 
non-FSs do. 

The optical luminosity of the FSs is strongly correlated 
with X-ray luminosity (r$ = 0.92, p < le — 7) and the 
intrinsic scatter in \og(L tot ) is only 0.14 ±0.02 dex. Sim- 
ilar results are found for the XCS clusters, however, the 
intrinsic scatter is slightly larger. Substituting Tx for 
Lx (Figure [5]) produces an even tighter relation, the in- 
trinsic scatter in log(Lt D i) is reduced to 0.10 ± 0.01 dex, 
although the correlation is slightly weakened (rs = 0.85, 
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Figure 6. The optical luminosity of the galaxy system within 
0.5-R2O0 as a function of Tx- The intrinsic scatter about the FS 
relation is greatly reduced both when compared to the FS relation 
with Lx and the XCS cluster relations with Lx and Tx- Red 
circles are FSs and blue squares arc XCS clusters. 

p < 0.0001) due to a larger number of outliers. Again 
similar results are found for the XCS clusters. 

The X-ray temperatures are used to infer the radii 
within which we calculate the total optical luminosity 
of the systems. So it is reasonable to expect there to 
be some induced correlation between Tx and L to t- For 
example, as we scatter upwards in X-ray temperature, 
we would of course infer a larger radius which to include 
the optical light from member galaxies (but never less). 
We tested the amplitude of this correlated scatter by ex- 
amining the change in the fraction of light as we change 
i?200 based on the Tx errors (±ler). We find that half of 
the FSs have optical luminosities that are identical after 
changing the radius. The other half have scatter in their 
L t ot that correlates with scatter in Tx ■ These 8 systems 
have the largest L tot errors in Table 2. However, the 
slope of the fit to the fractional scatter in L tot versus Tx 
in these systems is only 0.3. For a given system, if the 
Tx scatters up by 100%, the optical luminosity scatters 
up by ~ 30%. In Figure O we see that the relationship 
between total optical luminosity and X-ray temperature 
is much steeper than this correlated scatter. For this 
reason and the fact that half the FS have uncorrelated 
scatter between Tx and L tot , we conclude that the real 
scatter in Figure [6] is not much larger than our measured 
scatter (see also Appendix [A]) . 

In Figure [7] we show the stellar mass of the dominant 
galaxy (i.e. an FG in an FS or a BCG in a non-FS) as 
a function of Tx- The red circles are our FGs, the blue 
squares are the z < 0.25 XCS BCGs, and the gree n trian- 
gles are the z ~ 1 BCGs from lStott et~aTl pOlOh . While 
the fits to the two BCG samples are consistent with each 
other (the green dashed and blue dotted lines) the fit to 
the FGs is offset to higher mass for a given Tx (the red 
solid line). In Figure [8] we present the same information 
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Figure 7. The variation of the stellar mass of the dominant galaxy 
with Tx showing that, for a given Tx, FGs are amongst the most 
massive galaxies in the Universe. The red circles are the FGs, the 
blue squares are the XCS BCGs a nd the green triangles are the 
2-1 BCGs of IStott etaLl | |2uT0h . The red solid line is the fit 
to the FG data, the blue dotted line is th e fit to the XCS BCGs, 
and the dashed green line is the fit to the Stott et al. BCG data. 
The black dot-dashed line shows the slope (1.72 ) of th e power law 
relation between M500 and Tx from[Stott et al. (2010), which has 
been normalized to the FG fit at Tx =3.0 keV. 

as in Figure except we represent the stellar mass of 
the dominant galaxy by its optical luminosity and the 
system mass by the total optical luminosity (see Section 
13.31) . Since we do not have stellar masses for all our FGs 
nor BCGs we make use of the r-band luminosity which 
we expect to be a good proxy for stellar mass, to maxi- 
mize our sample sizes. Just as in Figure [71 the FGs sam- 
ple the upper edge of the BCG distribution and similar 
to Figures [5] and [5] the scatter in the FGs is remarkably 
small. 

We hnd that there is a strong correlation between Tx 
and Af* (r s = 0.76, p < 0.001) but that none of the stel- 
lar populations parameters are found to be correlated. 
These results show that, while the mass of the system 
hosting the FG determines how massive it can become, 
it has little effect on the formation and evolution of the 
stars that compose it. Similar results are found when 
substituting Tx with Lx and the significance of the re- 
sult is unaffected after excluding the two systems with 
Amu < 2.5. 

There are two results in Figure [7] The first is that FGs 
are more massive than XCS BCGs for any given Tx, and 
so the mean stellar mass of the ensemble of FGs has to 
be higher than non-FGs. This explains in our results 
in Figure [5] in Section 21 where we found that the mean 
stellar mass of the FG sample is significantly higher than 
the non-FS samples. 

Figure [7J shows that, at any given Tx, FGs have the 
highest stellar mass. There are, however, a few XCS 
BCGs that have large stellar masses and are not classified 
as fossils. We note that these exceptions all have A77114 
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Figure 8. This figure is similar to Figurc[7]but now we show the 
variation of the optical luminosity of the dominant galaxy with the 
total optical luminosity of the system. As in Figure [7] the FGs lie 
at the upper edge of the BCG distribution. Red circles are FGs 
and blue squares are BCGs. 



of ~ 2, which is close to the FS classification threshold. 
There is one FG that lies well below the rest but we have 
reason to doubt the X-ray emission in this system and 
we refer the reader to Appendix IB. 21 for a discussion. 

The second result from Figure [JJ is that the z ~ 0.1 
FGs are ~ 100% (~ 40%) more massive than the z ~ 1 
BCGs at 1 keV (10 keV). The fits to the low- and high- 
redshift BCG samples are consistent with each other. 
Therefore, if the z ~ 1 BCGs are the progenitors of 
the XCS BCGs, which are at z < 0.25, then this figure 
shows that BCGs do not have any significan t growth at 
z 5, F consistent with previous studies (e.g. IStott et al.1 
2010). However, if BCGs from either sample are the 
progenitors of FGs, then the BCGs need to grow by 
~ 40 — 100% to reach the average mass of a FG at 
z ~ 0.1. Semi- analytic models of early-type g alaxy for- 
mation in clusters (e.g., iDe Lucia et al.l 120061 ) . suggest 
that F Gs form via gas-poor me rgers even at low red- 
shifts (jDi'az-Gimenez et "all 120081 ) . despite the fact that 
FSs assemble most of their virial mass at higher red- 
shifts. There is some observational evidence that sug- 
gests BCGs contin ue to grow through dissi pationless 
mergers even today (jBrough et al.l [20051 120111 ) and sim- 
ulations predict that a large fractio n of the BCG mass 
today was accrued in the last 5 Gyr (|De Lucia fc Blaizotl 
l2007t iRuszkowski fc Sprmg cl 2009). Our results suggest 
that neither the high- or low-redshift BCGs have evolved 
and that neither are the progenitors of FGs, unless sig- 
nificant major merging events by galaxies well outsid e 
the systems have yet to occur (e.g. lBrough et al1l2011l ). 

The black dashed line shows how the mass of the un- 
derlying system (i.e., the group or cluster mass) scales 
with temperat ure via the M-T x relation (core-excised) 
from Table 1 in Maughan] (|2007D . Both the stellar masses 
of the dominant galaxies and the total masses of the 



The Stellar Mass Assembly of Fossil Galaxies 



11 




1 2 



Am 14 (mag) 

Figure 9. The highly-significant correlation between Amy and 
the stellar mass of the dominant galaxy. Red circles are FGs and 
blue squares are XCS BCGs. The solid line is the fit to the data, 
both the FGs and the XCS BCGs. The data point with the arrow 
is J124425.9 (see text for details). 

groups / clusters scale with Tx , but their relationships are 
decoupled. 

We can see from Figure [§] that the mass of the dom- 
inant galaxy is strongly correlated with the size of the 
magnitude gap. The question is whether the magnitude 
gap and the stellar mass of the galaxy are part of a cause 
and effect relationship: are the FG stellar masses large 
because the magnitude gap is large? These two quanti- 
ties are found to be positively correlated with high sig- 
nificance (r s = 0.58, p < 0.00001). The solid line is the 
fit to the data, both the FGs and the XCS BCGs. The 
data point with the arrow in this plot (and the following 
plots) is J124425.9. The FG in this system has a double 
nucleus and so we elect not to apply the sky-subtraction 
correction; the arrow denotes the position of this system 
in this plot if we did apply the correction. 

In Figure \W\ we examine the fraction of total optical 
luminosity contained within the dominant galaxy as a 
function of the magnitude gap Ami4. Red circles are 
FGs and blue squares are XCS BCGs. The trends seen 
in this figure are still present after we split the sample 
into high-mass systems (with Tx > 2 keV — dashed line 
and open symbols) and low-mass systems (Tx < 2 keV — 
dotted line and filled symbols). The magnitude gap bins, 
within which the medians were calculated, were chosen to 
contain approximately equal numbers and are as follows: 
0.9-1.29, 1.3-1.79, 1.8-2.49, 2.5-2.99, and 3.0-3.5. The 
data point with the arrow is J124425.9 and it is not use 
to calculate the median in this or any following plots. 

The fraction of total optical luminosity and the mag- 
nitude gap are highly correlated (rs — 0.80, p = (ac- 
tually 2e — 12)). There is a large spread in the fraction 
of light contained within the dominant galaxy, especially 
for the FGs. The values range from 30% all the way up to 
85%, consistent with the values that have been reported 
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Figure 10. The fraction of total cluster light contained within the 
dominant galaxy as a function of Amy. The FGs can contain a 
significant fraction of the total light of a system (up to 85%) while 
for the XCS BCGs this fraction is generally lower. Red circles 
are FGs and blue squares are XCS BCGs. The dashed line shows 
the median fraction for low-mass systems (open symbols; Tx < 2 
keV) and the dotted line is for high-mass systems (filled symbols; 
T x > 2 keV). The data point with the arrow is J124425.9 (see text 
for details). 




T x (keV) 



Figure 11. The fraction of the total optical luminosity in the 
system contained within the dominant galaxy as a function of Tx ■ 
Again, the FGs lie at the upper edge of the BCG distribution and 
form a tight sequence. The red circles are the FGs and the blue 
squares are the XCS BCGs. 
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Figure 12. Dominant galaxy luminosity as a function of Amy. 
This plots shows that luminous dominant galaxies are likely to be 
found in systems with large magnitude gaps. The red circles are 
the FGs and the blue squares are the XCS BCGs. The dashed line 
and the dotted line are the medians of the low-mass and high-mass 
systems. The data point with the arrow is J124425.9 (see text for 
details). 

in the li terature (lAguerri et al.l (|201 lh found a value of 
15% and I Jones et al.i (|2000D 70%). This spread is caused 
by the strong correlation that exists between the dom- 
inant galaxy light fraction and the mass of the system 
(|Lin fc Mohrll200l . which we show in Figure [TT] using 
Tx as a proxy for system mass, and the lack of a correla- 
tion between Amu and system mass. In this figure red 
circles are FGs and blue squares are XCS BCGs. Similar 
to Figures [7] and [8j the FGs sample the upper edge of 
the BCG distribution and have a much smaller scatter. 

Looking at the trends with system mass, in Figure [lOl 
it appears that there is a bi-modality at high magnitude 
gaps with a cleaner separation between the high-mass 
and low-mass systems. FGs in low- mass systems contain 
a larger fraction of the total optical luminosity of the sys- 
tem compared to the FGs in low-mass systems. However, 
our data are insufficient to be able to make any definite 
claims. The high-mass outlier is J030659.8, which was 
also an outlier in Figures 0] and [7] 

In Figure [12] we show how the optical luminosity of 
the dominant galaxy changes with magnitude gap and in 
Figure fT3l we show how the total optical luminosity of the 
second-to-fourth brightest galaxies (EL24) changes with 
magnitude gap. The symbols and line styles are the same 
as those used in Figure [TUJ From Figure [TJ] we see that 
the overall trend is for the luminosity of the dominant 
galaxy to increase as the magnitude gap grows. These 
two quantities are significantly correlated (rs — 0.52, 
p < 0.00005). Similar to Figure fTUl we see the suggestion 
of a bi-modality at high magnitude gaps. In Figure [13J 
SL24 is either constant or slightly decreasing suggesting 
that very little merging (and certainly no major merging) 
is occurring amongst these galaxies. As in Figure[T0l the 
high-mass outlier is J030659.8. 
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Figure 13. SL24 as a function of Amy. The light contained 
within EL24 remains constant or decreases as the magnitude gap 
increases. The red circles are the FGs and the blue squares are the 
XCS BCGs. The dashed line and the dotted line are the medians 
of the low-mass and high-mass systems. The data point with the 
arrow is J124425.9 (see text for details). 

6. DISCUSSION 

The debate about what FSs truly represent is far from 
over. Whether they are the end-point of mergers in 
groups/clusters or not is a contentious issue. If FSs do 
not represent a distinct class of objects then the cause 
of their magnitude gap, which is their distinguishing fea- 
ture, needs to be addressed. Two of the leading pos- 
sibilities are an early formation epoch coupled with a 
favourable configur ation for the rapid and efficient accre- 
tion of companions (Pones et apl2003t iKhosroshahi et all 
2001 ID'Onghia et all 120051: IKhosroshahi et all I2007t 
Dariush et al. I I2007t I von B enda-Beckma nn et al.l 2008) 
and the failed group scenario (|Mulchaev fe Zabludofil 
[1991 . 

The results of this study provide some answers to this 
question. The observed facts that: 1) our FSs exist at all 
system masses; 2) our FSs are not deficient in L* galaxies 
(see the CMDs in Appendix iBj); and 3) the FGs have the 
largest stellar mass of any cluster BCG (as a function of 
the system mass), all rule out the failed group scenario. 
These are normal groups and clusters with populated red 
sequences but with an extremely massive BCG. 

The correlations with magnitude gap (Figures [HI EH 
[T2l and fT5|) support the idea that the gap is an indi- 
cator of the evolutionary phase of an FS, with a large 
gap indicative of an early-forming system that contains 
an FG that has undergone many small mergers. This 
observati onal result is ent i rely co nsistent with the simu- 
lations of D'Onghi a et al.l ([2005). Therefore, any model 
of FS formation and evolution must do the following as 
the magnitude gap grows: 

• The luminosity of the FG must increase. 

• The fraction of the total optical luminosity of the 
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system contained in the FG must increase. 

• The total fraction of light contained in the second- 
to-fourth brightest galaxies must not increase. 

If repeated mergers within the system are responsible 
for the above points then there are two likely scenarios 
and the fact that the luminosity contained within the 
second-to-fourth brightest galaxies stays constant with 
magnitude gap is important. Firstly, the FG may grow 
by accreting many smaller galaxies leaving the second- 
to-fourth brightest galaxies untouched. This scenario 
de-emphasises dynamical friction and favors fast and effi- 
cient mergers with galaxies with low impact parameters. 
It is supported by the fact that in all our FSs only two 
(J030659.8 and J123338.5) have fourth brightest galaxies 
that are less luminous than L* . In all other systems the 
FG is so luminous that it is more than 2.5 mag brighter 
than L*. Secondly, the FG could undergo fewer merg- 
ers with smaller galaxies and then develop the magni- 
tude gap in a single equal-mass merger. This requires 
that the second-to-fifth brightest galaxies be of similar 
luminosity, so that when one is accreted the luminosity 
contained within the second-to-fourth brightest galaxies 
remains roughly constant. Both of these scenarios imply 
that the merging is more efficient in FSs than non-FSs, 
i.e. a higher fraction of the stellar mass is retained in the 
FG. 

For the cluster-sized FSs there is an additional forma- 
tion scenario. The FG could form in a group where the 
relative velocities are conducive to merging, before ac- 
creting (or being accreted by) another system. This sec- 
ond system, however, must be lacking in bright galaxies 
otherwise the magnitude gap would vanish and the sys- 
tem would no longer qualify as a FS, ma king this mode 
of form ation unlikely, yet not impossible. S chirmer et al.l 
(2010) found J0454-0309 to consist of two systems, a 
sparse cluster and an in-falling FS, the latter they believe 
will seed the FG. They also find, outside a radius of 1.5 
Mpc, two filaments that extend over 4 Mpc. In this study 
we find two systems (J141627.7 and J141657.5) that are 
possibly interacting and represent a possible example of 
this scenario. 

If only one process leads to the development of an FS 
then it must be possible for this process to occur in a sys- 
tem of any mass, i.e. we have FSs ranging in mass from 
groups to clusters. Looking at Figures [TU1 [T2l and [TBI the 
case could be made for two different formation mecha- 
nisms: one in the high-mass systems and another in the 
low-mass systems. Although our data is not definitive 
we will nonetheless speculate about this possibility. 

It is possible that the high-mass FSs represent those 
that evolved to have a magnitude gap via mergers while 
the low-mass FSs are those that were formed with a mag- 
nitude gap. The systems that evolved to be FSs would 
contain FGs that have undergone numerous mergers and 
so would be expected to have large masses. They would 
also exist in rich systems and so the chances of them 
developing large magnitudes gaps would be small. The 
systems that were formed as fossils, on the other hand, 
would contain FGs that had not undergone many (if any) 
mergers and so would be relatively less massive. Being 
poor systems they are statistically more likely to have 
larger magnitude gaps. 



The results from Figure [7] suggest that z ~ 1 BCGs 
cannot be the progenitors of low-redshift FSs without a 
major merger or an abnormal number of minor mergers 
at z < 1. According to the fits in this figure, at 3 keV, an 
average FG has M* = 8x 10 11 M Q and an average BCG 
has M»=5x 10 11 M . The FG is therefore 60% more 
massive than the BCG. If a BCG was to undergo a single 
major merger to form an FG then this would require a 
merger ratio of less than 1:1.6 (i.e. a nearly equal- mass 
merger). However, a z ~ 1 galaxy undergoes a single 
major merger (merger ratios 1:1—1:4) in ^ 8 Gyr and a 
single minor mer ger (merger ratios 1:4—1:10) in ~ 3 Gyr 
(|Lotz et al.ll2011l ). The lookback time from z = 0.15 (the 
mean redshift of the FG sample) to z ~ 1 in ~ 6 Gyr. 

Essentially the FG must be formed from the merger 
of two BCGs, a rare event at low redshift. In the SDSS 
90% of all major me rgers occur between ga laxies that 
are fainter than L* (pattern fe Atfieldl 120081) . To have 
two BCGs merging requires the merger of two clusters- 
another rare event at low redshifts. 50% of massive 
halos at z = had t heir last major merger at z > 1 
(jFakhouri et aljfeOlOf) . The above considerations sug- 
gest that FS are not a phase of normal gr oup/cluster 
evolution (jvon Benda-Beckmann et~a l. 2008). 

Closely linked to the formation of the magnitude gap 
is the formation of the FG. Why are they amongst the 
most massive galaxies for a given system mass? Are 
FGs formed later than BC Gs in non-FSs through ga s- 
poor mergers (dry mergers IDiaz-Gimenez et al.l [20081? 
Are they formed via g as-rich mergers (wet mergers 
IKhosroshahi"et~al . 2006b)? Or is it a combination of the 
two-wet mergers early with most o f their mass being as- 
semb led later through dry mergers dMendez-Abreu et al.l 
12011 ). The results of our stellar population analysis, 
which found no differences between FGs and BCGs, rules 
out late-time wet mergers. However, it is possible that 
the stellar population models we are using are unable to 
resolve such small differences. 

The secret to determining the formation method of 
FGs and FSs may lie in the diffuse stellar component 
(DSC). The DSC is an important component of a clus- 
ter's overall luminosity and is composed of material that 
has been stri pped from cluster galax i es during dynamical 
interactions (jFeldmeier et al.l 120021: IRudick et all [20061 
2009). Anywh ere from 10-40% of a system's total lu- 
minosity can be found in the DSC, with the quantity 
increasing with time due to mergers. It is likely that 
there is no universal DSC fraction, but that different 
groups/clusters will have different DSC levels, depend- 
ing on their specific evolution and history (|Murante et al.l 
[2001 IRudick et al.l 120061 : IConrov et al.ll2007l) . Also, dif- 
fering dynamical interactions are found to create distinct 
structures within the DSC. For example, tidal streams 
are associated with fast, close encounters between a 
galaxy and the dominant galaxy. The DSC can provide 
a wealth of information on the dynamical h istory of both 
the d ominant galaxy and the cluster itself ()Rudick et al.l 

unnD. 

The detection of structure in the DSC would indicate 
recent mergers and that the FG is still rapidly evolving 
at z = 0.2 (e.g. iBrough et all 120111 ) . The lack of struc- 
ture (but the existence of DSC) would indicate that the 
FG has not evol ved th rough major mergers since z > 1 
iStott et alJ (e.g. 120081) . On the other hand, a lack of 
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DSC altogether would indic ate that the FG forme d its 
mass without major mergers iMulchaev fc Zabludofj (e.g. 

EH!)). 

7. CONCLUSIONS 

Using X-ray data from the XCS combined with opti- 
cal data from the SDSS we have defined a sample of 17 
FSs at z < 0.25, of which 14 have not been classified as 
such previously. This catalog represents not only an in- 
crease in the number of known FSs but also an increase 
in the quality of the X-ray data used to study such sys- 
tems. Using the data from XCS we examine the X-ray 
scaling relations of FSs. For the FGs, we estimate stel- 
lar masses, ages, and metallicities using starlight and 
obtain star-formation rates from MPA-JHU. Using these 
data compare the stellar mass assembly and the stellar 
populations of FGs to two samples of BCGs, one opti- 
cally selected and one X-ray selected. The main results 
from this paper are as follows: 

1. Fossil systems, i.e. systems with a large magnitude 
gap, have masses that range from those of galaxy 
groups to those of galaxy clusters. 

2. At fixed halo mass, the stellar mass of the dominant 
galaxy in FSs is larger than those in non-FSs. 

3. The fraction of light in the dominant galaxy, as 
well as the luminosity of the dominant galaxy, in- 
creases with magnitude gap for all galaxy groups 
and clusters. 

A scenario whereby FSs form at high redshift and FGs 
grow to high masses through fast and efficient mergers 
could explain most of the results in this paper. A study 
of the intracluster light in both FSs and non-FSs could 
help to decide whether the above explanation is plausible 
or not. 
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Table 1 

The de t ails of the fossil systems in our sample. The ID matches a system to a fossil galaxy in Table [2] Ami2 and Amy are the 

Jones ct al. (2003) and Dariush ct al. (2010) magnitude gaps (quoted errors are from the photometry, while those in parentheses are those 
that arise due to the errors on -R200); ^200 is in Mpc; Tx is in keV; Lx is the X-ray luminosity measured inside R200 in 10 44 /i^" n ergs -1 ; 
Ltot is the total r-band luminosity inside O.5R200 i n 10 11 Lq\ and SL24 is the total r-band luminosity of the second to fourth brightest 
galaxie s in 10 11 Lp,. FS references: 1-IDi'az -Gimcncz ct al. (2008); 2- Jones ct al. (2003); 3-Cypriano ct al. (2006); 4- Khosroshahi et al.l 
(2006al) : 5 4Khosroshahi et al.1 ll2006bl) r6 lVoevodkin et al.l (120101 ); 7- IProctor et al.1 (120111) ; 8- ISantos et~alTll2007l) . a The system was an 
XMM target. b The magnitude of the FG has not been corrected so the magnitude gap of this system is a lower limit; see Appendix IB. 101 

for details. 
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APPENDIX 

A. THE ROBUSTNESS OF THE FOSSIL SYSTEM DEFINITION 

The most-commonly used definition of an FS was first set out in lJones et a l. (2003, the exact definition is given in 
the Introduction). For a system to be classified as an FS in this study it must have Lx,ho\ ^ 5 x 10 41 /if 2 erg s _1 and 
a magnitude gap of 2.5 in the r-band between the brightest and the fourth brightest galaxies located within half the 
virial radius, which we denote as A77114. As mentioned in the Introduction, one of the main reasons why there are so 
few confirmed FSs in the literature is the lack of high quality X-ray data. Low S/N X-ray data makes it extremely 
difficult to not only detect extended sources but to also estimate their luminosity. It is for this reason that many 
samples are defined as 'optical fossil' samples, i.e. samples that are only known to satisfy the optical criterion. 

The source detection algorithm used in the XCS, the XCS Automated Pipeline Algorithm (Xapa), has made this 
project possible. Many difficulties can arise when trying to detect and measure X-ray sources, which usually have 
low counts: a PSF and sensitivity that varies over the instrument's field-of-view; deblending of point and extended 
sources; and background determination to name but a few (LD11, Mil). However, once in possession of accurate Lxs 
determining whether or not a system satisfies the X-ray criterion is a straight-forward matter. 

Determining whether or not the optical criterion is satisfied, however, is a more complicated matter. To be able to 
determine the magnitude gap of a system requires an estimate of the virial radius and redshift information for each 
of the galaxie s, which is re quired to determine whether a galaxy is a member of the system or not. Many FS studies 
start with the lJones et al.l definition and then modify it in various ways depending on circumstances. For example, if 
no estimate of the virial radius is available then a fixed aperture may be used. 

St udies have been perf ormed on the effect the adopted definition of the magnitude gap has on the properties of FS 
(e.g. iDariush et al.l[2010l ) but to date no study has been done on the robustness of the FS definition. In this appendix 
we test the robustness our FS classifications by varying some of the parameters used during the search for FSs, e.g. 
the size of the redshift cuts. We also investigate how th e results of th is study change when we 1) u se different rad ii to 
define the magnitude gap of our systems and 2) use the lJones et al.l magnitude gap instead of the IDariush et all 

In this study we relied on the redshift data provided by the SDSS to determine membership. Specifically, any galaxy 
with a spectroscopic redshift less than Acz — 2000 km s _1 or a photometric redshift less than Azphot =0.1 away from 
the potential FG was considered a system member. Given that the average photometric redshift error for our sample 
was 0.04, this cut is quite a generous and errs on the side of caution. The magnitude gap was then calculated based 
on the galaxies that populated the red sequence of the systems CMD, within ±0.2 mag in color of the potential FG. 

To test the robustness of our classifications we varied some of these values to see how that would affect the classifi- 
cations. We varied the spectroscopic- redshift cut in steps of 250 km s _1 up to 2000 km s _1 and the search radius in 
steps of 0.25i?2oo from 0.25i?2oo to i?2oo- We also considered two values for the color cut, 0.15 and 0.2 mag, and the 
photometric redshift cut, Az = 0.1 and 0.2. 

We find that the FS definition that we use is here is robust to both the spectroscopic-redshift and photometric- 
redshift cuts, and the color-cut. The only quantity that it depends on is the size of the search radius. In all cases, 
reducing the search radius from -R200 to 0.25i?2oo increases the magnitude gap, in some cases doubling it. Our sample 
is inherently robust to variations in the radius, since for many (more than half) of our systems we could have increased 
the search radius well beyond 0.5i?2oo and the system would have still been classified as an FS. We also examined the 
effects on our measured magnitude gaps (and therefore FS classification) of the errors on i?2oo- Systems whose gaps 
change when varying -R200 within the errors are noted in the individual notes. These changes in i?2oo would not reduce 
our measured Lxs below th at r equired to define a FS. 

As mentioned in Section [2T2l our i?2oo s are estimated from the TX-R200 relation found in lArnaud et al.l ((2005), 
which, although not as good as actually measuring i?200j is perfectly reasonable. There are, however, other methods 
of estimating i?2oo when a direct measurement is not possible. Here we investigate the effects on our results that the 
choice of -R200 has by repeating our analysis using four different estim ates of -ffeon: a fixed .35 Mpc aperture; a fixed 
0.5 Mpc aperture; an i? 2 oo estimated from t he -/V200-R200 relatio n of Uohnston et al.l (|2007l ); and an i? 2 oo estimated 
from the Tx-i?2nn derive d from the data in IMantz et al.l (|2010l ). In Figure IATI we compare th e -Rbqq-Tx r e lation 
from lArnaud et al.l ([20051 . solid line) to the relation determined from the Tx and -R500 data from IMantz et al.l (|2010t 
dotted line). The stars are the data used in this paper (open for the XCS clusters and closed for the FS). Open 
circles, triangles, and square s are the BCS. REF LEX, MACS clu sters respecti vely, where we show the core excised 
temperat ures and radii from IMantz et al\ (|2010f l. The fit to the IMantz et al.l d ata produces a slightly larger -R500 
than the lArnaud et al.l fit (~ 20% at Tx = 1-0 k eV). We obta in the Mantz et al.l -R200 estimates by scaling the -R200 
estimates from lArnaud et al.l by the ratio of the Ma ntz et al.l and lArnaud et al.l -R500 estimates. We find that this 
small shift in the R,2oo~Tx scaling relations only weakly affects our sample definition and induces only a small amount 
of scatter into relevant figures. Similar effects are found when using the fixed 0.35 Mpc apertures. 

The other two estimators, the fixed 0.5 Mpc aperture and the A^oo estimate, produce larger effects. Compared to 
our sample of 17 FSs, the former finds only 13 and the latter finds 21, i.e. the number of FSs found at these extremes 
is ±25% of that found in this study. The reason for the decrease in the case of the fixed 0.5 Mpc aperture is due to 
the fact that almost half of our sample have 0.5i?20O less than 0.5 Mpc. The search radius is therefore larger than 
necessary and includes more galaxies, which results in a smaller magnitude gap. The reason for the increase in the 
case of the .R200 estimates from the -/V200--R200 relation is most likely due to the fact that true FSs, if they are the 
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end-products of galaxy merging in groups/ clusters, should have an -R200 that is too large for their richness compared 
to that of normal groups/clusters. Therefore if such a relation, based on normal groups/clusters, is used then the i?200 
estimates (and the search radii) will be too small resulting in larger magnitude gaps. Such a shift in sample size will 
have a significant impact on the results of an FS number density studies. 

These two estimators also introduce scatter into many of the tight relations that FSs were found to fo llow . This can 
be seen by comparing the left panel of Figure IA2I to Figure [6] and the right panel to Figure [TT] Figure IA2I shows the 
optical luminosity of the galaxy system within 0.5i?2oo as a function of Tx (left panel) and the fraction of that light 
contained within the dominant galaxy as a function of Tx (right). Note that using A^oo to estimate i?2oo, as we do 
here, changes the radius within which we calculate L to t (and therefore changes L ga i/ L to t), but should not affect Tx 
because at these radii the Tx profile should be flat. In the case of the left plot the scatter in this relation (for both 
FSs and XCS clusters) increases from 0.2 dex to 0.25 dex when using A^oo to estimate i?200- Note that by using A^oo, 
we are also entirely removing any induced correlation between L tot and Tx in Figure [6] since the radius we used to 
calculate L to t is independent of Tx (see Section [5]). So the small increase in scatter in Figure A2 compared to Figure 
6 could be from removing this correlation. In the right plot, apart from the increased scatter, we also see a boosting 
of the fraction at high Tx because these systems are richer and therefore a small decrease in radius can reduce the 
number of galaxies used to calculate the magnitude gap more than for a low-Tx system. This results in a smaller Ltot 
and a correspo ndingly large r fract ion of the system light being con tained in these domin ant galaxies. 

If we use the Pones et al.l ()2003l ) magnitude gap rather than the iDariush et al.l (|2010h magnitude gap we find that 
less systems are classified as an FS (~ 30% less) and we lose some statistical power. There are no changes to the overall 
results of this study but we find that the tight trends we find for FSs (such as in Figure [TT] ) are now more sparsely 
populate d as some of our FS s have been re-classifi ed as non-F Ss. Interestingly, in the study of the magnitude gap 
made by IDariush et al.l (|2010f ) it was found that the lJones et al.1 magnitude gap was better at finding high-mass halos. 
We expected to lose mostly low-mass FSs when making this change but actually found that it was the intermediate 
mass FSs that fell out of the sample. 

In summary, we find that our definition of an FS is robust to changes to the cuts used in redshift and color space 
but that it is sensitive to changes in i?200- We, therefore, tested various estimates of i? 2 oo and found that the number 
of systems classified as an FS can vary by ±25%. All our results hold for the samples defined using various other 
estimates of i?2oo but there is increased scatter in the correlations. Our results are also insensitive to our choice of 
magnitude gap definition. It is, therefore, important to have accurate estimates of i?2oo when defining FS samples, 
especially if estimating their number density. 

B. NOTES ON INDIVIDUAL SYSTEMS 

In this appendix we provide notes on each of the 17 fossil systems, along with SDSS images with XCS contours 
overlaid and CMDs. In the CMDs, green triangles are galaxies that, based on their SDSS spectroscopic redshift, are 
considered system members (i.e. Acz < 2000 km s _1 ) while red triangles are those that are not. Black circles are 
galaxies that, based on their SDSS photometric redshift, are considered system members (i.e. Az p hot < 0.1). The 
blue diamonds mark the galaxies used to calculate Am\2 and Amu. The dotted lines are the color cuts employed 
when calcul ating the magnitud e gaps. Where possible velocity dispersions have been calculated using the bi-weight 
estimator of lBeers et al.l (fl990f ) and for those systems for which it was not possible to estimate a velocity dispersion 
values were obtained from the literature (if available). 
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T x /5keV 



Figure Al. A comparison of the i?soo— Tx relation from Arnaud ct al. (2005, solid line) to the relation determined from the Tx and R500 
data from Mantz ct al. (2010), dotted line). The stars are the data used in this paper (open for the XCS clusters and closed for the FS). 
Open circl es, triangles, and sq uares are the BCS, REFLEX, MACS clusters respectively, where we show the core excised temperatures and 
radii from lMantz et al.l H20ia j. 
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T x (keV) T x (keV) 

Figure A2. The optical luminosity of the galaxy system within 0.5-R200 as a function of Tx {left) and the fraction of the o ptic al luminosity 
in the system contained within the dominant galaxy as a function of Tx (right) these plots are th e same as Figure s |6l and 111 I but the R200 
used to define the fossil systems and calculate Ltot is estimated from the -/V200~^200 relation of IJohnston et al. (2007) and they exhibit 
larger scatter. 
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B.l. XMMXCS J015315. 0+010214.2 




This z = 0.0597 system is located at 01:53:15.0 +01:02:14.2. The CMD shows an obvious RS, which all spectroscopic 
members lie on, and the magnitude gap is 2.7 based on SDSS photometric data, i.e. not all of the four brightest galaxies 
have a spectroscopic redshift. However, the two galaxies that only have photometric redshifts both lie on the RS as 
well. The X-ray temperature of the system is XV = 1.1 keV and the X-ray emission peak lies ~ 8.6 kpc from the FG. 
The system is located near the edge of the SDSS footprint but at a distance of ~ 1.4i? 2 oo both Ami 4 and L to t should 
be unaffected. The X-ray source is extended with i?2oo = 0.66 Mpc or ~ 30i?go (Rgo is the radius within which 90% 
of the galaxy's light is contained, in this case the FG). The system has a velocity dispersion of 266 km s^ 1 based on 
18 galaxies. 



B.2. XMMXCS J030659.8+ 000824.9 
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This z = 0.0751 system is located at 03:06:59.8 +00:08:24.9. The CMD shows an RS and both spectroscopic 
members lay on it. The magnitude gap is 2.5 based on SDSS photometric data and the galaxy that was excluded 
has a spectroscopic redshift that is ~ 38000 km s" 1 away from the FG. Of the four galaxies used to calculate Amu, 
the two that only have photometric redshifts both lie on the RS as well. The X-ray emission has TV = 2.3 keV and 
the peak lies ~ 26.2 kpc from the FG. The system is located near the edge of the SDSS footprint but at a distance 
of ~ 7.0i?2oo both Ami4 and L to t are unaffected. The X-ray source is extended with i? 2 oo = 1.01 Mpc or ~ 90i?go 
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(the largest of any system). We note that this system is an outlier in many of the plots and despite satisfying all 
the criteria necessary to be classified as an FS we acknowledge that this classification is uncertain. The system has a 
velocity dispersion of 1082 km s _1 based on 13 galaxies. 



The Stellar Mass Assembly of Fossil Galaxies 
B.3. XMMXCS J073422.2+265143.9 



23 




This z = 0.0796 system is located at 07:34:22.2 +26:51:43.9. The CMD shows an RS and all spectroscopic members 
lay on it. The magnitude gap is 3.0 based on SDSS spectroscopic data, i.e. all of the four brightest galaxies have a 
spectroscopic redshift. The X-ray emission has TV = 1.1 kcV and the peak lies ~ 1.5 kpc from the FG. The system is 
located near the edge of the SDSS footprint but at a distance of ~ 10.0i?2oo both Ami4 and L tot are unaffected. The 
X-ray source is extended with i? 2 oo = 0.67 Mpc or ~ 30i?go. Reducing i? 2 oo by its error would increase the magnitude 
gap by 0.5. The system has a velo city dispersion of 411 km s _1 based on 18 galaxies. This syste m was classified as a 
fossil in lDi'az-Gimenez et al.l (|2008l ) and was an XMM target in a program to study fossil systems. iDi'az-Gimenez et al.l 
quote a virial radius of 1.6 Mpc (twice as large as that found here) and a velocity dispersion of 551 km s -1 . 

B.4. XMMXCS J083454.8+553420.9 




This z = 0.2412 system is located at 08:34:54.8 +55:34:20.9. There is only the slightest hint of an RS in the CMD. 
The magnitude gap is 3.0 based on SDSS photometric data. The X-ray emission has TV. = 3.9 keV and the peak lies 
^3.3 kpc from the FG. The X-ray source is extended with i? 2 oo = 1.17 Mpc or ~ 30i?go. 
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This z = 0.1121 system is located at 09:25:40.0 +36:27:11.1. The CMD shows an RS and all spectroscopic members 
lay on it. The magnitude gap is 2.8 based on SDSS photometric data. The rejected galaxy lies ~ 7800 km s _1 away 
from the FG. The X-ray emission has Tx = 3.0 keV and the peak lies ~ 26.1 kpc from the FG. The X-ray source is 
extended with i? 2 oo = 1.14 Mpc or ~ 40i?go. Increasing i? 2 oo by its error would decrease the magnitude gap by 0.3 but 
the system would still be classified as a fossil. We note that there is a bright galaxy just outside 0.5i?2oo that would 
change A77744 if it were included. The system has a velocity dispersion of 435 km s" 1 based on 22 galaxies. 

B.6. XMMXCS J 101703. 6+390250.7 




This z = 0.2056 system is located at 10:17:03.6 +39:02:50.7. The CMD shows the hint of an RS and both spec- 
troscopic members lay on it. The magnitude gap is 2.7 based on SDSS photometric data. The X-ray emission has 
Tx = 6.6 keV (the highest of any system) and the peak lies ~ 4.6 kpc from the FG. The X-ray source is extended 
with -R200 = 1-63 Mpc (the largest of any system) or ~ 30i?9o- We note that there is a bright galaxy just outside 
0.5i?2oo that would change Ami4 if it were included. The system has a velocity dispersion of 1013 km s^ 1 based on 
10 galax ies. This system is als o known as Abell 0963 and was an XMM target. From the literature, L x = 6.1 x 10 44 
erg s" 1 (jSoltan & Henry||1983l half of that found here), a = 1350 ± 200 km s~* (lEaverv fc Henrvill998l ). It is an X-ray 
lensing cluster that is unusually relaxed with < 5% substructure (jSmith et al.H2005lV 
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This z = 0.1381 system is located at 01:40:44.4 +39:57:10.4. The CMD shows an obvious RS, which all spectroscopic 
members lie on, and the magnitude gap is 3.1 based on SDSS spectroscopic data. The X-ray temperature of the system 
is TV — 3.5 keV and the X-ray emission peak lies ~ 3.2 kpc from the FG. The X-ray source is extended with -R200 = 1-22 
Mpc or ~ 25i?9o- The system has a velocity dispersion of 1248 km s _1 based on 18 galaxies. This system is also known 
as Ab ell 1068 and was an XMM target. It is also a cooling-flow cluster with Lx = 5 x 10 44 erg s _1 (|Quillen et al.l 
120081) . 

B.8. XMMXCS J123024. 3+111127.8 
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-z = 0.1169 T x =0.8 m t2 =2.1 m 14 = 3.5 




This z = 0.1169 system is located at 12:30:24.3 +11:11:27.8. There are very few galaxies in the vicinity of the FG, 
however, those that are have redshifts (one spectroscopic and two photometric) that place them at the same redshift. 
The CMD shows the hint of an RS, which both spectroscopic members lie on, and the magnitude gap is 3.5 (the largest 
of any system) based on SDSS photometric data. The X-ray temperature of the system is Tx — 0.8 keV and the X-ray 
emission peak lies ~ 15.6 kpc from the FG. The X-ray source is extended with i?2oo = 0.54 Mpc or ~ 20i?go, but is 
located close to the edge of an XMM field. We note that there is a bright galaxy just outside 0.5i?2oo that would 
change Amu if it were included. No velocity dispersion could be measured for this system. 



2G 



Harrison et al. 
B.9. XMMXCS J123338. 5+374114. 9 




This z = 0.1023 system is located at 12:33:38.5 +37:41:14.9. Again, there are very few galaxies in the vicinity of 
the FG, however, the CMD shows the hint of an RS. The only spectroscopic system member (the FG) lies on the RS 
and the magnitude gap is 3.2 based on SDSS photometric data. The rejected galaxy is ~ 11100 km s _1 away from 
the FG. The X-ray temperature of the system is Tx = 0.9 keV and the X-ray emission peak lies ~ 21.6 kpc from the 
FG. The X-ray source is extended with i?2oo = 0.58 Mpc or ~ 30i?9o- No velocity dispersion could be measured for 
this system. 



B.10. XMMXCS J124425. 9+164758.0 
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This z — 0.2346 system is located at 12:44:25.9 +16:47:58.0. The FG in this system has a double core, therefore 
we cannot trust the sky-subtraction correction and so have not applied it. This means that the magnitude gap of 
2.3 is a lower limit. If we applied the estimated correction the gap would be 3.3 and if we only applied the average 
correction from the other 16 FSs the gap would be 2.6, therefore we accept this as an FS. The X-ray temperature of 
the system is Tx — 1.3 keV and the X-ray emission peak lies ~ 25.8 kpc from the FG. The X-ray source is extended 
with -R200 = 0.63 Mpc or ~ 20i?9Q. No velocity dispersion could be measured for this system. 
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B.ll. XMMXCS J130749. 6+292549.2 




This z = 0.2406 system is located at 13:07:49.6 +29:25:49.2. There is the hint of an RS in the CMD, which the FG 
lies on, and the magnitude gap is 3.1 based on SDSS photometric data. There are no other spectroscopic objects in 
the vicinity of the FG, however, those objects that have photometric redshifts are at the same redshift. The X-ray 
temperature of the system is Tx — 3.2 keV and the X-ray emission peak lies ~ 18.9 kpc from the FG. The X-ray 
source is extended with i? 2 oo = 1-04 Mpc or ~ 25itW No velocity di spersion could be measured for this system. This 
system is also known as ZwCl 1305.4+2941. iGastaldello et all (2008) find T x = 3.17 + 0.19 KeV (fully consistent with 
our measurement) and Lx,5O0 = (1-25 ± 0.16) x 10 44 h^ 2 erg s^ 1 . 

B.12. XMMXCS J131145. 1+220206.0 
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This z = 0.1715 system is located at 13:11:45.1 +22:02:06.0. The CMD shows an obvious RS, which all spectroscopic 
members lie on, and the magnitude gap is 2.7 based on SDSS photometric data. The rejected galaxy is ~ 50000 km s _1 
away from the FG. The X-ray temperature of the system is Tx — 3.4 keV and the X-ray emission peak lies ~ 95.3 kpc 
from the FG. The X-ray source is extended with -R200 = 1-16 Mpc or ~ 25i?go. The system has a velocity dispersion 
of 362 km s" 1 based on 10 galaxies. This system is also known as MaxBCG J197.94248+22.02702. 
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B.13. XMMXCS J134825. 6+580015.8 
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This z = 0.1274 system is located at 13:48:25.6 +58:00:15.8. The CMD shows an obvious RS, which the only 
spectroscopic members (the FG) lies on, and the magnitude gap is 2.6 based on SDSS photometric data. The rejected 
galaxy is ~ 10000 km s _1 away from the FG. The X-ray temperature of the system is TV = 1.6 keV and the X-ray 
emission peak lies ~ 7.4 kpc from the FG. The X-ray source is extended with -R200 — 0.78 Mpc or ~ 20i?go, but it is 
located near the edge of the XMM chip. The system has a velocity dispersion of 526 km s _1 based on 5 galaxies. 

B.14. XMMXCS J 14-1627.7+231525.9 




r-band magnitude 



This z = 0.1382 system is located at 14:16:27.7 +23:15:25.9. The CMD shows an obvious RS, which all spectroscopic 
members lie on, and the magnitude gap is 2.9 based on SDSS photometric data. The rejected galaxy is ~ 10500 km 
s _1 away from the FG. The X-ray temperature of the system is Tx — 3.7 keV and the X-ray emission peak lies 
~ 13.9 kpc from the FG. The X-ray source is extended with i?2oo = 1-25 Mpc or ~ 25i?go- The system has a 
velocity dispersion of 646 km s _1 based on 21 g alaxies. This system wa s an XMM targe t and w as cl assified as an FS in 
Uones et al.l (|2003l) and subsequently studied in lVoevodkin et ail (|2010l ). ICvpriano et al.l (|2006l ). and lKhosroshahi et al.l 
(|2006allbh . It is also known as ZwCl 1413.9+2330. From the literature: Ami 2 = 2.4, L x = 2.2 x 10 44 h^" 2 erg s _1 , and 
T x = 1. 53 ± 0.35 keV (IJones et "all[200l : L x = 1.11 x 10 44 hf 2 erg s" 1 (ICvpriano et al.ll200ll : T x ~ 4 keV, a - 700 
km s" 1 dKhosroshahi et al.ll2006aD : and Am 12 = 1.7 L x = 6.09 x 10 43 erg s _1 ; i? 500 = 0.89 Mpc and a = 652 km s" 1 
(jVoevodkin et al.ll2010D . This system appears to be associated with XMMXCS J141657.5+231239.2. 
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This z = 0.1159 system is located at 14:16:57.5 +23:12:39.2. The CMD shows an RS and the magnitude gap is 3.1 
based on SDSS photometric data. We note that there is a bright galaxy just outside 0.5i?2oo that would change Ami4 
if it were included. The X-ray temperature of the system is Tx — 0.9 keV and the X-ray emission peak lies ~ 7.0 
kpc from the FG. The X-ray source is extended with -R200 = 0.56 Mpc or ~ 20i?go- No velocity dispersion could be 
measured for this system. This system appears to be associated with XMMXCS J141627. 7+231525. 9. 

B.16. XMMXCS J160129.8+083856.3 
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This z = 0.1875 system is located at 16:01:29.8 +08:38:56.3. The CMD shows no sign of an RS and the magnitude 
gap is 3.1 based on SDSS photometric data. Increasing -R200 by its error would decrease the magnitude gap by 0.3 but 
the system would still be classified as an FS. We note that there is a bright galaxy just outside 0.5i?2oo that would 
change A77114 if it were included. The X-ray temperature of the system is Tx = 1.7 keV and the X-ray emission peak 
lies ~ 18.1 kpc from the FG. The X-ray source is extended with -R200 = 0.77 Mpc or ~ 35i?9o- No velocity dispersion 
could be measured for this system. 
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B.17. XMMXCS J172010. 0+263724.7 




This z = 0.1596 system is located at 17:20:10.0 +26:37:24.7. The CMD shows an obvious RS, which all spectroscopic 
members lie on, and the magnitude gap is 2.45 based on SDSS spectroscopic data. This is close enough to 2.5 that 
we accept it as an FS. The two rejected galaxies are 2400 km s _1 and 20000 km s _1 away from the FG. Reducing the 
spectroscopic redshift cut to 1000 km s _1 would increase the magnitude gap by 0.4. The X-ray temperature of the 
system is XV = 5.5 keV and the X-ray emission peak lies ~ 20.2 kpc from the FG. The system is located near the edge 
of the SDSS footprint but at a distance of ~ 17i?200 both Ami4 and L tot are unaffected. The X-ray sourc e is extended 
with i?200 = 1-54 Mpc or ~ 30i?go. This system was an XMM target and was classified as an FS in iSantos et al.l 
(|2007h . It is also known as SDSS-C4 3072. The system has a velocity dispersion of 768 km s _1 based on 31 galaxies. 



